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(57) Abstract 

A metiiod of producing libraries of genes encoding antigen-combining 
molecules or antibodies; a method of produdng antigen-combining molecules 
which does not require an in vivo procedure; a method of obtaining antigen- 
combining molecules of selected specificity which does not require an in vivo 
procedure; vectors useful in the present method; and antig^-oombining mole- 
cules produced by the method. The antigen-combining molecules are useful for 
the detection, quantitation, purification and neutralization of aTitigens, as well 
as for diagnostic, therapeutic and prophylactic purposes. 
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Production of antibodies using gene libraries. 



Pescriptlon 

Ba ckground of t he Inv e ntion 

Monoclonal and polyclonal antibodies are useful for 
a variety of purposes. The precise antigen specificity 
of antibodies aakes them poverful tools that can be used 
for the detection, quantitation » purification and 
neutralization of antigens. 

Polyclonal antibodies are produced in,,ylvp by 
i&nunizing animals « such as rabbits and goats, vith 
antigens, bleeding the animals and isolating polyclonal 
antibody molecules from the blood. Monoclonal antibodies 
are produced by hybridoma cells, which are made by 
fusing, in vitro . Immortal plasmacytoma cells with 
antibody producing cells (Kohler, G. and C. Milstein, 
Hature , 256:495 (1975)) obtained from animals immunized 
in vivo vith antigen. 

Current methods for producing polyclonal and mono- 
clonal antibodies are limited by several factors. First, 
methods for producing either polyclonal or monoclonal 
antibodies require an in vivo immunization step. This 
can be time consuming and require large amounts of 
antigen. Second, the repertoire of antibodies expressed 
in^vivo is restricted by physiological processes, such as 
those vhich mediate self -tolerance that disable auto- 
reactive B cells (Goodnow, C.C., et^al . , Mature , 334 :676 
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(1988); Goodnov, J Basic and Clinical I mmunology, Ed. 
5, ^Los Altos, CA, Large Medical Publications (1984); 
Young. C.R,, Molecular Immunolog y, New York, Marcel 
Dekker (1984)). Third, although antibodies can exist in 

05 millions of different forms, each vlth its own unique 
binding site for antigen, antibody diversity is 
restricted by genetic mechanisms for generating antibody 
diversity (Honjo, T. , An n. Re v , Immuno l, . 1:499 (1983); 
Tonegava, S., Nature; 302 ; 575 (1983)). Fourth, not all 

10 the antilfody molecules which can be generated will be 

generated in a given animal. As a result, raising high 
affinity antibodies to a given antigen can be very time 
consuming and can often fail. Fifth, the production of 
human antibodies of desired specificity is very 

15 problematical. 

A method of producing antibodies which avoids the 
limitations of presently-available methods, such as the 
requirement for immunization of an animal and in viv o 
steps, would be very useful, particularly if it made it 

20 possible to produce a wider range of antibody types than 
can be made using presently*available techniques and if 
it made it possible to produce human antibody types. 

Disclosure of the Invention 

The present invention relates to a method of produc- 

25 Ing libraries of genes encoding antigen*combining 

molecules or antibodies; a method of producing antigen* 
combining molecules, also referred to as antibodies, 
which does not require an in vivo procedure, as is 
required by presently^available methods; a method of 

30 obtaining antigen-combining molecules (antibodies) of 



selected or defined specificity which does not require an . 
in ylyo procedure; vectors useful in the present method 
and antibodies produced or obtained by the method. 

The present invention relates to an in vitro process 

05 for synthesizing DNA encoding families of antigen- 

combining molecules or proteins. In this process, DNA 
containing genes encoding antigen- combining molecules is 
obtained and combined with oligonucleotides which are 
homologous to regions of the genes which are conserved. 

10 Sequence - specif ic gene amplification is then carried out 
using the DNA containing genes encoding antigen-combining 
proteins as template and the homologous oligonucleotides 
as primers. 

This Invention also relates to a method of creating 

15 diverse libraries of DNAs encoding families of antigen- 
combining proteins by cloning the product of the in^vitro 
process for synthesizing DNA, described in the preceeding 
paragraph, into an appropriate vector (e.g., a plasmld, 
viral or retroviral vector) . 

20 The subject Invention provides an alternative method 

for the production of antigen- combining molecules, which 
are useful affinity reagents for the detection and 
neutralisation of antigens and the delivery of molecules 
to antigenic sites. The claimed method differs from 

25 production of polyclonal antibody molecules derived by 

immunization of live animals and from production of mono- 
clonal antibody molecules through the use of hybrldoma 
cell lines in that it does not require an In^vlvo 
Immunization step, as do presently available methods. 

30 Rather, diverse libraries of genes which encode antigen- 
combining sites comprising a significant proportion of an 
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animal's repertoire of antibody combining sites are made, 
as described in detail herein. These genes are expressed 
in living cells, from which molecules of desired 
antigenic selectivity can be isolated and purified for 
05 various uses. 

Antigen* combining molecules are produced by the 
present method in the following manner, which is 
described in greater deail below. Initially, a library 
of antibody genes which includes a set of variable 
10 regions encoding a large, diverse and random group of 
specificities derived from animal or human immunoglob- 
ulins is produced by amplifying or cloning diverse 
genomic fragments or cDMAs of antibody mRNAs found in 
antibody-producing tissue. 
j5 In an optional step, the diversity of the resulting 

libraries can be increased by means of random muta- 
genesis. The gene libraries are introduced into cultured 
host cells, which may be eukaryotic or prokaryotic, in 
which they are expressed. Genes encoding antibodies of 
20 desired antigenic specificity are identified, using a 

method described herein or known techniques, isolated and 
expressed in quantities in appropriate host cells, from 
which the encoded antibody can be purified. 

Specifically, a library of genes encoding 
25 immunoglobulin heavy chain regions and a library of genes 
encoding immunoglobulin light chain regions are con- 
structed. This Is carried out by obtaining antibody- 
encoding DNA,^ which Is either genomic fragments or cDNAs 
of antibody mRNAs, amplfying or cloning the fragments or 
30 cDNAs; and introducing them into a standard framework 
antibody gene vector, which is used to introduce the 



antibody-encoding DNA into cells in which the DNA is 
expressed. The vector includes a framework gene encoding 
a protein, such as a gene encoding an antibody heavy 
chain or an antibody light chain which can be of any 

05 origin (human, non-human) and can be derived from any of 
a number of existing DNAs encoding heavy chain immuno- 
globulins or light chain immunoglobulins. Such vectors 
are also a subject of the present invention and are 
described in greater detail in a subsequent section. 

10 Genes from one or both of the libraries are introduced 
into appropriate host cells, in which the genes are 
expressed, resulting in production of a wide variety of 
antigen- combining molecules. 

Genes encoding antigen-combining molecules of 

15 desired specificity are identified by identifying cells 
producing antigen^combining molecules which react with a 
selected antigen and then obtaining the genes of 
interest. The genes of interest can subsequently be 
Introduced into an appropriate host cell (or can be 

20 further modified and then introduced into an appropriate 
host cell) for further production of antigen-combining 
molecules, which can be purified and used for the same 
purpose^r for which conventionally-produced antibodies are 
used. 

25 Through use of the method described. It is possible 

to produce antigen-combining molecules which are of wider 
diversity than are antibodies available as a result of 
known nethods; novel antigen- combining molecules with a 
diverse range of specificities and affinities and 

30 antigen- combining molecules which are predominantly human 
in origin. Such antigen-combining molecules are a 
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subject of the present Invention and can be used 
clinically for diagnostic, therapeutic and prophylactic 
purposes, as veil as in research contexts, and for other 
purposes • 

Brief Description of the Drawing s 

Figure 1 is a schematic representation of the method 
of the present invention by which antigen^combining 
molecules, or antibodies, are produced. 

Figure 2 is a schematic representation of amplifica- 
tion or cloning of IgH heavy chain variable region DMA 
from mRNA, using the polymerase chain reaction* 
Pan el A shows the relevant regions of the poly adenylated 
mRMA encoding the secreted form of the IgM heavy chain. 
S denotes the sequences encoding the signal peptide which 
causes the nascent peptide to cross the plasma membrane. 
V, D and J together comprise the variable region, ^g^t 
C„2, and C„3 are the three constant domains of C/i. Hinge 
encodes the hinge region. C, B and Z are oligonucleotide 
PGR primers (discussed below) . 

20 Pan el B shows the reverse transcript DNA product of the 
mRMA pr£med by oligonucleotide Z, with the addition of 
poly-dC by terminal transferase at the 3' end. 
Panel C is a schematic representation of the annealing of 
primer A to the reverse transcript DMA. 

25 Panel D shows the final double stranded DMA PGR product 
made utilizing primers A and B. 

Panel E shows the product of PGR annealed to primer C. 
Pan el F is a blowup of Panel £, showing in greater detail 
the structure of primer C. Primer C consists of two 
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With far gre.t.r diversity than is shown by antibodies 
produced by currently- available techniques. 

The present invention relates to a method of 
producing libraries of genes encoding antigen- combining 
molecules (antibody proteins) with diverse 
.ntigen-combining specificities; a ■athod of producing 
such antigen.combining molecules, .ntigen-combining 
molecules produced by the method and vectors useful in 
the method. The following is a description of generation 
of such libraries, of the present method of producing 
antigen-combining molecules of selected specificity and 
of vectors useful in producini antigen-combining 
molecules of the present invention. 

As described below, the process makes use of 
techniques which are known to those of skill in the art 
and can be applied as described herein to produce and 
identify antigen-combining molecules of desired antigenic 
specificity: the polymerase chain reaction (PCR) to 
amplify and clone diverse cDNAs encoding antibody mRNAs 
found in antibody-producing tissue; mutagenesis protocols 
to further increase the diversity of these cDNAs; gene 
transfer protocols to introduce antibody genes into 
cultured (prokaryotic and eukaryotic) cells for the 
purpose of expressing them; and screening protocols to 
detect genes encoding antibodies of the desired antigenic 
specificity. A general outline of the present method is 
represented in Figure 1. 
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Con8tructlon^f_jj^bra ry_ of Gene s En..>>^4^ ^ 
Antlgen-Coablntnp. Molee ules 

A key .tep in the production of antigen-combining 
molecule, by the present .ethod i« the construction of a 
■library- of antibody genes which include -variable- 
regions encoding a large, diverse, but random set of 
specificities. The library can be of human or non-human 
origin and is constructed as follows: 

Initially, genomic DNA encoding antibodies or cDNAs 
of antibody »RNA (referred to as antibody-encoding dna) 
is obtained. This DNA can be obtained from any source of 
antibody.producing cells, such as spleen cells, 
peripheral blood cells, lymph nodes, inflammatory tissue 
cells and bone marrow cells. It can also be obtained 
from a genomic library or cDNA library of B cells. The 
antibody.producing cells can be of human or non-human 
origin: genomic DNA or mRNA can be obtained directly from 
the tissue (I.e.. without previous treatment to remove 
cells which do not produce antibody) or can be obtained 
after the tissue has been treated to Increase 
concentration of antibody.producing cells or to select a 
particular type (8) of antibody.producing cells (i.e.. 
treated to^ enrich the content of antibody.producing ' 
cells). Antibody.producing cells can be stimulated by an 
agent which stimulates antibody mRNA production (e.g., 
lipopolysaccharide) before DNA Is obtained. 

Antlbody.encodlng DNA Is amplified and cloned using 
a known technique, such as the PCR using appropriately, 
selected primers. In order to produce sufficient quanti- 
ties of the DNA and to modify the DNA in such a manner 
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(e.g.. by addition of appropriate restriction sites) that 
it can be introduced as an insert into an E^li cloning 
vector. This cloning vector can serve as the expression 
vector or the inserts can later be introduced into an 
expression vector, such as the fra»ewoxk antibody gene 
vector described below. Amplified and cloned DNA can be 
further diversified, using mutagenesis . such as PGR. in 
order to produce a greater diversity or vider repertoire 
of antigen-binding molecules, as veil as novel antigen- 
binding aiolecules. 

Cloned antibody-encoding DNA is introduced into an 
expression vector, such as the framework antibody gene 
vector of the present invention, which can be a plasmid 
Viral or retroviral vector. Cloned antibody-encoding DNA 
is inserted into the vector in such a manner that the 
cloned DNA will be expressed as protein in appropriate 
host cells. It is essential that the expression vector 
used »ake it possible for the DNA insert to be expressed 
as a protein in the host cell. One expression vector 
useful in the^present method is referred to as the 
framework antibody gene vector. Vectors useful in the 
present method contain antibody constant region or 
portions ^hereof in such a manner that when amplified DNA 
is inserted, the vector expresses a chimeric gene product 
comprising a variable region and a constant region in 
proper register. The two regions present in the chimeric 
gene product c,n be from the .ame type of immunoglobulin 
molecule or from two different types of immunoglobulin 
nolecules • 

These libraries of antibody-encoding genes are then 
expressed in cultured cells, which can be eukaryotic or 
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prokaryotlc. The libraries can be Introduced Into host 
cells separately or together. Introduction of the 
antibody-encoding DHA In vitro Into host cells (by 
Infection, transformation or transfectlon) Is carried out 
using knovn techniques, such as electroporatlon, 
protoplast fusion or calcium phosphate co-preclpltation. 
If only one library Is Introduced Into a host cell, the 
host cell will generally be one which makes the other 
antibody chain, thus making It possible to produce 
complete/functional antlgen-blndlng molecules. For 
example, If a heavy chain library produced by the present 
method is Introduced Into host cells, the host cells will 
generally be cultured cells, such as myeloma cells or E. 
coli, which naturally produce the other (i.e., light) 
chain of the immunoglobulin or are engineered to do so. 
Alternatively, both libraries can be Introduced Into 
appropriate host cells, either simultaneously or 
sequentially. 

Host cells In which the antibody-encoding DNA Is 
expressed can be eukaryotic or prokaryotic. They can be 
immortalized cultured animal cells, such as a myeloma 
cell line which has been shown to efficiently express and 
secrete Introduced Immunoglobulin genes (Morrison, S.L., 
et_al., Ann. M.Y. Acad. Sel , 507:187 (1987); Kohler, G. 
and C. Mllsteln. Eur . J. lamttnol . . 6:511 (1976); 01, 

SP , ftl • . Iminuno globulln Cene Expressio n in 
Irans formed Lymphoid Cells. |0:825 (1983); Davis, A.C. 
and M.J. Shulman, Iinmunol. Today. 10:119 (1989)). One 
host cell which can be used to express the antibody- 
encoding DNA Is the J558L cell line or the SP2/0 cell 
line. 
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Cells expressing antlgen-cofflblnlng molecules with a 
desired specificity for a given antigen can then be 
selected by a variety of neans. such as testing for 
reactivity with a selected antigen using nitrocellulose 
layering. The antibodies Identified thereby can be of 
human origin, nonhuman origin or a combination of both. 
That Is, all or some of the components (e.g., heavy 
chain, light chain, variable regions, constant regions) 
can be encoded by DNA of human or nonhuman origin, which, 
when expressed produces the encoded chimeric protein 
which. In turn, may be human, nonhuman or a combination 
of both. In such antigen- combining molecules, all or 
some of the regions (e.g.. heavy and light chain variable 
and constant regions) are referred to as being of human 
origin or of nonhuman origin, based on the source of the 
DNA encoding the antigen-combining molecule region in 
question. For example, in the ease in which DNA encoding 
mouse heavy chain variable region is expressed in host 
cells, the resulting antigen-combining molecule has a 
heavy chain variable region of mouse origin. Antibodies 
produced may be used for such purposes as drug delivery, 
tumor Imaging and other therapeutic, diagnostic and 
prophylactic uses. 

Once antibodies of a desired binding specificity are 
25 obtained, their genes may be Isolated and further 
mutagenlzed to create additional antigen combining 
diversity or antibodies of higher affinity for antigen. 
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Constructles_of_l5suno£l^^^ 

£Sd^ducnon_of_EncodedMitl£^^ Moircun™ 

The following is a detailed description of a~ 
specific experlnental protocol which embodies- the 
concepts described above. Although the following Is « 
description of one particular eBbodlment. the same 
procedures can be used to produce libraries In which the 
immunoglobulin and the heavy chain class are different or 
m which light Cham genes are amplified and cloned. The 
present Invention Is not Intended to be limited to this 
example. In the embodiment presented below, a diverse 
heavy chain gene library Is constructed. Using the 
principles described in relation to the heavy chain gene 
library, a diverse light chain gene library is also 
constructed. These are co-expressed In an Immortal tumor 
cell capable of producing antibodies, such as plasma- 
cytoma cells or myeloma cells. Cells expressing antibody 
reactive to antigen are identified by a nitrocellulose 
filter overlay and antibody is prepared from cells 
Identified as expressing It. As described In a subse- 
quent section, there are alternative methods of library 
construction, other expression systems which can be used 
and alternative selection systems for Identifying antl- ' 
body-produislng cells or viruses. 

Step 1 in this specific protocol Is construction of 
libraries of genes in E, coll which encode Immunoglobulin 
heavy chains. This is followed by the use of random 
mutagenesis to Increase the diversity of the library, 
which Is an optional procedure. Step 2 Is Introduction 
of the library, by transfectlon. Into myeloma cells. 
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Step 3 Is Identification of myeloma cells expressing 
antibody with the desired specificity, using the 
nitrocellulose filter overlay technique or techniques 
known to those of skill in the art. Step 4 Is isolation 
05 of the gene(s) encoding the antibody with the desired 
specificity and their expression In appropriate host 
cells, to produce antigen-eombining fragments useful for 
a variety of purposes. 

Constructi on 

* " ' " ' ■' — 

10 One key step in construction of the library of cDNAs 

encoding the variable region of mouse heavy chain genes 
is construction of an E. coll' plasmid vector, designated 
pFHC. pFHC contains a "framework" gene, which can be 
any antibody heavy chain and serves as a site into which 

15 the amplified cloned gene product (genomic DNA or cDNA of 
antibody mRNAs) is Introduced. pFHC Is useful as a 
vector for this purpose because it contains REl and RE2 
cloning sites. Other vectors which include a framework 
gene and other cloning sites can be used for this purpose 

20 as well. The framework gene includes a transcriptional 
promoter (e.g., a powerful promoter, such as a Moloney 
LTR (Mulligan, R.C., In Experimental Jlanl^ul£^j^T^ftf_n^Tie 
Ex2resslA, New York Adacemic Press, p. 155 (1983)) and a 
Cfi chain transcriptional enhancer to Increase the level 

25 of transcriptions from the promoter (Gillies, S.D., et 
£1" Cell, 33:717 (1983), a cloning site containing REl 
and RE2; part of the C/i heavy chain gene encoding 
secreted protein; and poly A addition and termination 
sequences (Figure 3). The framework antibody gene vector 

30 of the present Invention (pFHC) also includes a 
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selectable Barker (e.g., an antibiotic resistance gene 
such as the neomycin resistance gene, neo^) for animal 
cells; sequences for bacterial replication (orl) ; and a 
selectable marker (e.g., the amplclllln resistance gene, 
Amp ) for bacterial cells. The framework gene can be of 
any origin (human, non-human), and can derive from any 
one of « number of existing DNAs encoding heavy chain 
Immunoglobulins (Tucker, P.W., et al .. Science . 206:1299 
(1979); Honjo, T. , et_al. , Cell. 18:559 (1979); Bothwell. 
A.L.M., et_al., Cell, 24:625 (1981); Liu. A.Y, et. al. . 
Gene. 54:33 (1987); Kawakami. T.. et_al. , Ku c. Icidl . 
Res^. 8:3933 (1980)). In this embodiment, the victor 
retains the Introns between the Cj,l. hinge. Cjj2 and Cjj3 
exons. The -variable region" of the gene, which Includes 
the V, D and J regions of the antibody heavy chain and 
which encodes the antigen binding site. Is deleted and 
replaced with two consecutive restriction endOnuclease 
cloning sites, REl and RE2. The restriction endonuclease 
site REl occurs just 3' to the LTR promoter and the 
restriction endonuclease site RE2 occurs within the 
constant region Just 3' to the J region (see Figure 3). 

Another key step In the production of antigen- 
combining molecules in this embodiment of the present 
Invention Is construction In an E. coll vector of a 
library of cDNAs encoding the variable region of mouse 
Immunoglobulin genes. In this embodiment, the pFHC 
vector, which Includes cloning sites designated REl and 
RE2. is used for cloning heavy chain variable regions, 
although any cloning vector with cloning sites having the 
same or similar characteristics (described below) can be 
used. Similarly, a light chain vector can be designed. 
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using the above described procedures and procedures known 
to a person of ordinary skill in the art. 

• In this embodiment, non- immune mouse spleens are 
used as the starting material. mRNA is prepared directly 
from the spleen or from spleen processed in such a manner 
that it is enriched for resting B cells. Enrichment of 
tissue results in a more uniform representation of 
antibody diversity in the starting materials. 
Lymphocytes can be purified from spleen using ficoll 
10 gradients (Boyum, A., Scand. J. cf Clinical Invest.. 

21:77 (1968)). B cells are separated from other cells 
(e.g., T cells) by panning with anti-IgM coated dishes 
(Wysocki, L.J. and V.L. Sato, Proc . Na tl. Acad. Sci . . 
75:2844 (1978)). Because activated cells exprlll the 
IL-2 receptor but resting B cells do not. resting B cells 
can be separated yet further from activated cells by 
panning. Further purification by size fractionation on a 
Cell Sorter results in a fairly homogeneous population of 
resting B cells. 

Poly A+ bRNA from total mouse spleen is prepared 
according to published methods (Sambrook, J., et_al. , 

M olecular Cl eningj A_Laboratorv Manual. 2d Ed.. Cold 

Spring Hatbor Laboratory Press, Cold Spring Harbor, NY 
(1989)). Production of antibody mRNA can first be 
25 stimulated by lipopolysaccharide (LPS) (Andersson. J. A., 
££-fii-. Jt Eatp.-Ked^, 145:1511 (1977)), First strand 
cDNA is prepared to this mRNA population using as primer 
an oligonucleotide, 2, which is complementary to Cfi in 
the Cjjl region 3' to J. This primer is designated 2 in 
Figure 2. First strand cDNA is then elongated by the 
terminal transferase reaction with dCTP to form a poly dC 
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tail (Saabrook, J., e_t al.. Molacular Cloning : A 
Laboratory Ma ntial, 2d Ed.. Cold Spring iJIrtor Laboratory 
Press, Cold Spring Harbor, NY (1989)). 

This DNA product is then used as tenplate in a 
polyaerasa chain reaction (PCR) to amplify cDNAs encoding 
antibody variable regions (Saiki. R.K.. et al.. Science 
239:487 (1988); Ohara. 0., et^. . Proc^Iu ._AcarisIi . 
2SA, 86:5673 (1989)). Initially. PCR is carried out vlth 
two priners: priaer A and primer B, as represented In 
Figure 2. Primer A contains the REl site at its 5' end. 
followed by poly dG. Primer B is complementary to the 
constant (C^l) region of the C^ gene. 3' to the J region 
and 5' to primer Z (see Figure 2). Primer B is 
complementary to all Cm genes, which encode the heavy 
chain of molecules of the IgM class, the Ig class 
expressed by all B cell clones prior to class switching 
(Schimizu, A. and T. Honjo, Cell, 36:801-803 (1984)) and 
present in resting B cells. The resultant PCR product 
includes a significant proportion of cDNAs encompassing 
the various regions expressed as IgM in the mouse. 
(The use of other primers complementary to the cDNA genes 
encoding the constant regions of other immunoglobulin 
heavy chains can be used in parallel reactions to obtain 
the variable regions expresffed on these molecules, but 
for simplicity these are not described). 

Hext, the product of the first PCR procedure is used 
again for PCR with primer A and primer C. Primer C. like 
primer B, is complementary to the Cm. gene 3 ' to J and 
just 5' to primer B (see Figure 2). Primer C contains 
the RE2 site at its 5' end. The RE2 sequence is chosen 
in such a manner that when it is Incorporated into the 
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with REl and RE2, is recloned into the framework vector 
pFHC. To the extent that nutation affects codons of the 
antigen binding region, this procedure increases the 
diversity of the binding domains. For example, if the 

05 starter library has a complexity of 10* elements, and an 
average of one mutation is introduced per complementarity 
determining region, and it is assumed that the 
complementarity determining region is 40 amino acids in 
size and that any of six amino ecid substitutions can 

10 occur at a mutated codon, the diversity of the library 

can be increased by a factor of about 40 x 6 , or 240, for 
single amino acid changes and 240 x 240, or about 
6 x 10 , for double amino acid changes, yielding a final 
diversity of approximately lo'"^. This is considered to 

15 be in the range of the diversity of antibodies which 

animals produce (Tonegawa, S., Nature, 302:575 (1983)). 
Even greater diversity can be generated by the random 
combination of H and L chains, the result of co-expres- 
sion in host cells (see below). It Is. thus, theoretl- 

20 cally possible to generate a more diverse antibody 

library in vitro than can be generated in vivo . This 
library of genes is called the "high diversity" heavy 
chain library. It may be propagated indefinitely in Ej. 
££ll- A high diversity light chain library can be 

25 prepared similarly. 

The framework vector for the light chain library, 
designated pFLC, includes components similar to those in 
the vector for the heavy chain library: the enhancer, 
promoter, a bacterial selectable marker, an animal 

30 selectable marker, bacterial origin of replication and 
light chain axons encoding the constant regions. For 
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pFLC, the animal selectable marker should differ from the 
animal selectable In pFHC. For example, If pFHC contains 
neo » pFLC can contain Eco gpt. 

A light chain library, which contains diverse light 

05 chain fragments, is prepared as described above for 

construction of the heavy chain library. In constructing 
the light chain library, the primers used are different 
from those described above for heavy chain library 
construction. In this instance, the primers are 

10 complementary to light chain mRNA encoding constant 

regions. The framework vector contains the light chain 
constant region exons . 

Introduction of the Library of I mmu noglobulin Chai n Genes 
into Immortalized Animal Cells 

15 The library of immunoglobulin chain genes produced 

as described is subsequently introduced into a line of 
immortalized cultured animal cells, referred to as the 
"host" cells, in which the genes in the library are 
expressed. Particularly useful for this purpose are 

20 plasmacytoma cell lines or myeloma cell lines which have 
been shown to efficiently express and secrete introduced 
immunoglo&ulin genes (Morrison, S.L,, et al . . Ann, N.Y. 
Acad. gci. . 507:187 (1987); Kohler, G. and C, Milstein. 
Eur. J. Immunol.. 6:511 (1976); Calfre and C. Kilstein, 

25 Methods Enzymol. . 73:3 (1981); Davis, A.C. and M.J. 

Shulman, Immunol. Today . 10:119 (1989)). For example, 
the J558L cell line can be cotransfected using electro- 
poration or protoplast fusion (Morrison, S.L., et al . , 
Ann. N.Y, Acad. Sci., 507:187 (1987)) and transfected 
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cells selected on the basis of auxotrophic markers 
• present on light and heavy chain libraries. 

As a result of eotransformation and selection for 
Barkers on both light chain and heavy chain vectors, most 

05 transforned host cells vlll express several copies of 
inBunoglobttlln heavy and light chains from the diverse 
library, and will express chimeric antibodies (antibodies 
encoded by all or part of two or more genes) (Nisonoff , 
In The A ntibody Molecule. Academic Press, NY 

10 p. 238 (1975)). These chimeric antibodies are of two 
types: those in which one chain is encoded by a host 
cell gene and the other chain is encoded by an exogen- 
ously introduced antibody gene and those in which both 
the light and the heavy chain are encoded by an exogenous 

15 antibody gene. Both types of antibodies will be 

secreted. A library of cells producing antibodies of 
diverse specificities is produced as a result. The 
library of cells can be stored and maintained in- 
definitely by continuous culture and/or by freezing. A . 

20 virtually unlimited number of cells can be obtained by 
this process. 

Igolatlon^of Cell8_Producln8 Antigen-Binding Molecules of 
Selected Specificity 

25 Cells producing antigen-binding molecules of 

selected specificity (I.e., which bind to a selected 
antigen) can be Identified and Isolated using 
nitrocellulose filter layering or known techniques. The 
same methods employed to identify and isolate hybrldoma 

30 cells producing a desired antibody can be used: cells 
are pooled and the supernatants tested for reactivity 
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with antigen (Harlow, E. and D. Lane. Antibodies: A 

Laboratory Manual, Cold Spring Harbor Laboratory, H.Y., 
p. 283 (1988). Subsequently, Individual clones of cells 
are identified, using known techniques. A preferred 

05 nethod for identification and isolation of cells makes 
use of nitrocellulose filter overlays, which allow the 
screening of a large nunber of cells. Cells from the 
library of transfected myeloma cells are seeded in 10 cm^ 
petri dishes in soft agar (Cook, W.D. and M.D. Scharff . 

10 74:5687 (1977); Paige, C.J., et^al. , Methods in 

Enzymol. . 150:257 (1987)) at a density of 10* colony 
forming units, and allowed to form small colonies 
(approximately 300 cells). A large number of dishes 
. (>100) may be so seeded. Cells are then overlayed with a 

15 thin film of agarose (<lmm) and the agarose is allowed to 
harden. The agarose contains culture medium without 
serum. Nitrocellulose filters (or other protein-binding 
filters) are layered on top of the agarose, and the 
dishes are Incubated overnight. During this time, 

20 antibodies secreted by the cells will diffuse through the 
agarose and adhere to the nitrocellulose filters. The 
nitrocellulose filters are keyed to the underlying plate 
and remoyed for processing. 

The method for processing nitrocellulose filters is 

25 identical to the methods used for Western blotting 

(Harlow, E. and D. Lane, Antibodies; Labora tory Manual . 

Cold Spring H^arbor, N.Y., p. 283 (1988)). The antibody 
molecules are adsorbed to the nitrocellulose filter. The 
filters, as prepared above, are then blocked. The 

30 desired antigen, for example, keyhole lymphet hemocyanln 
(KLH). which has been lodlnated with radioactive ^^^I, is 



then applied in Western blotting buffers to the filters. 
(Other, non radiographic ttethods can be used for 
detection). After Incubation, the filters are washed and 
dried and used to axpose autoradiography film according 

05 to standard procedures. Vhere the filters have adsorbed 
antibody Koleeules vhleb are capable of binding KLH, the 
autoradiography flln will be exposed. Cells expressing 
the KLH reactive antibody can be identified by 
deternining the location on the dish corresponding to an 

10 exposed filter; cells identified in this aanner can be 
Isolated using known techniques. Cells which are 
Isolated from a region of the dish can then be 
rescreened, to Insure the isolation of the clone of 
antigen-binding molecule -producing cells. 

15 Isolatio n of Gene s Encoding Antigen-Binding Moleeules of 
Selected Specificity and Purification of Enco ded 
Antigen-Binding Molecules 

The gene(s} encoding an antigen-binding molecule of 
selected specificity can be Isolated. This can be 

20 carried out, for exaopla, as follows: primers D and C 
(see Figures 2 and 3) are used in a polymerase chain 
reactlonr to produce all the heavy chain variable region 
genes introduced into the candidate host cell from the 
library. These genes are cloned again in the framework 

25 vector pFHC at the REl and RE2 sites. Similarly, all the 
light chain regions introduced Into the host cell from 
the library are cloned into the light chain vector, pFLC. 
Members of the family of vectors so obtained are then 
transformed palrwlse into myeloma cells, which are tested 

30 for the ability to produce and secrete the antibody with 
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the desired selectivity. Purification of the antibody 
from these cells can then be accomplished using standard 
procedures (Johnstone, A. and R. Thorpe, Inmunochem. In 
l£5Ctice, Blackwell Scientific, Oxford, p. 27 (1982); 
Harlow. E. and D. Lane, Antibodies: A Labo ra tory Manua l. 
Cold Spring Harbor Laboratory, H.Y. , p. 283 (1988)). 

A lteratio n ,of_Afflnity_of_Antige n-Blndin p Mol^^ui ^« 
It is also possible to produce antigen -binding 
molecules whose affinity for a selected antigen is 

10 altered (e.g., different from the affinity of a 

corresponding antigen-binding molecule produced by the 
present method). This can be carried out. for example, 
to increase the affinity of an antigen-binding molecule 
by randomly mutagenizing the genes isolated as described 

15 above using previously. described mutagenesis methods. 
Alternatively, the variable region of antigen-binding 
molecule-encoding genes can be sequenced and site 
directed mutagenesis performed to mutate the comple- 
mentarity determining regions (CDR) (Rabat, E.A., 

20 Immunol,. 141:S 25-36 (1988)). Both processes result in 
production of a sublibrary of genes which can be screened 
for antigen-binding molecules of higher affinity or of 
altered affinity after the genes are expressed in myeloma 
cells . 

25 Altern ative Materials and Proc edures fa^- ti««> in the 
Present Method 

In addition to those described above for use in the 
method of the present Invention, other materials (e.g., 
starting materials, primers) and procedures can be used 
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in carrying out the method. For example, use of PCR 
technology to clone a large collection of cDNA genes 
encoding variable regions of heavy chains has been 
described above. Although primers from the C/t class were 

OS described as being used in unidirectional nested PCR, the 
present Invention is not limited to these conditions. 
For example, primers from any of the ether heavy chain 
classes (C^j. Cy^, Cy^^^, Ca for example) or from light 
chains can be used. C/i was described as of particular 

10 use because of the fact that the entire repertoire of 
heavy.chaln variable regions are initially expressed as 
IgM. Only following heavy-chain class switching are 
these variable regions expressed with a heavy chain of a 
different class (Shimizu, A. and T, Honjo, Cell, 

15 36:801-803 (198A)). In addition, the predominant 
population of B cells in nonimmune spleen cells is 
Igll'^-cells (Cooper, M.D. and P. Burrows, In 
Immunog l obuli n Genes. Academic Press, N.Y. p. 1 (1989)). 
Although unidirectional nested PCR amplification is 

20 described above, other PCR procedures, as well as other 
DNA amplification techniques can be used to amplify DNA 
as needed in the present invention. For example, 
bidirectional PCR amplification of antibody variable 
regions can be carried out. This approach requires use 

25 of multiple degenerate 5' primers (Orlandi, R. , et al . . 
Pr oc. Natl. Ac a d. S ci. USA . 86:3833 (1989); Sastry, L. , 

• • Proc. Rati. Acad. Sci. PSA , 86:5728 (1989)). 
Bidirectional amplification may not pick up the same full 
diversity of genes as can be expected from unidirectional 

30 PCR. 
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In addition, methods o£ introducing further 
diversity into the antibody library other than the method 
for random mutagenesis utilizing PGR described above can 
be used. Other methods of random mutagenesis, such as 

05 that described by Sambrook, et al , (Sambrook, J., et al , , 
Molecular Cloning; A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989>) 
can be used, as can direct mutagenesis of the comple- 
mentarity determining regions (CDRs) • 

10 Framework vectors other than one using a mouse C/x 

heavy chain constant region, which contains the Cp 
enhancer and introns and a viral promoter (described 
previously) can be used for , inserting the products of 
PGR. The vectors described were chosen for their 

15 subsequent use in the expression of the antibody genes, 
but any eukaryotic or proka.ryotic cloning vector could be 
used to create a library of diverse cDNA genes encoding 
variable regions of antibody molecules. The inserts from 
this vector could be transferred to any number of 

20 expression, vectors • For example, other framework vectors 
which include intronless genes can be constructed, as can 
other heavy chain constant regions. In addition to 
plasmid vlectors, viral vectors or retroviral vectors can 
be used to introduce genes into myeloma cells. 

25 The source for -antibody molecule mRNAs can also be 

varied. Purified resting B lymphocytes from mouse 
nonimmunized spleen are described above as such a source. 
However, total spleens (immunised or not) from other 
animals, including humans, can be used, as can any source 

30 of antibody-producing cells (e.g., peripheral blood, 
lymph nodes, inflammatory tissue, bone marrow). 
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Introduction of H and L chain gene DNA into nyeloma 
cells using cotransf ormation by electroporation or 
protoplast fusion methods is described above (Morrison. 
S.L. and V.T. 01, Adv. Imnunol^ , 44:65 (19B9)). However, 

05 any aeans by vhlch DNA can be Introduced into living 
cells In vivo can be used, provided that it does not 
significantly Interfere, with the ability of the 
transformed cells to express the introduced DNA. In 
fact, a method other than cotransf ormation, can be used. 

10 Cotransfection vas chosen for its simplicity, and because 
both the H and L chains can be Introduced into myeloma 
cells. It may be possible to introduce only the H chain 
into myeloma cells. Moreover, the H chain itself in many 
cases carries sufficient binding affinity for antigen. 

15 However, other methods can also be used. For example, 
retroviral Infection may be used. Replication- Incompe - 
tent retroviral vectors can be readily constructed which 
can be packaged into infective particles by helper cells 
(Mann, R. , _et al. , Cell, 33:153-159 (1903)) • Viral 

20 titers of 10^ Infectious units per ml. can be achieved, 
making possible the transfer of very large numbers of 
genes, into myeloma cells. 

Further increases in the diversity of antibody- 
producing cells than results from the method described 

25 above can be generated If light and heavy chain genes are 
Introduced separately into myeloma cells. Light chain 
genes can be Introduced into one set of myeloma cells 
with one selectable marker, and heavy chains into another 
set of cells with a different selectable marker. Myeloma 

30 cells containing and expressing both H and L chains could 
then be generated by the highly efficient process of 
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polyethylene glycol aedlated cell fusion (Pontecorvo, G., 
Somatic Cell Genetics , 1:397 (1975)). Thus, a nethod of 
screening diverse librar.les of antibody genes using 
aninal cells is not limited by the number of cells which 
05 can be generated, but by the number of cells vhich can be 
screened. 

Methods of identifying antigen-binding molecule- 
expressing cells expressing an antigen-binding molecule 
of selected specificity other than the nitrocellulose 

10 filter overlay technique described above can be used. An 
important characteristic of any method is that it be 
useful to screen large numbers of different antibodies. 
With the nitrocellulose filter overlay technique, for 
example, if 300 dishes are prepared and 10^ independent 

15 transformed host cells per dish are screened, and if, on 

average, each cell produces ten different antibody 

4 7 
molecules, then 300 x 10 x 3 , or about 10 different 

antibodies can be screened at once. However, if the 

antibody molecules can be displayed on the cell surface, 

20 still larger numbers of cells can be screened using 
affinity matrices to pre-enrich for antigen-binding 
cells. There are immortal B cell lines, such as BCLj^B^^, 
which wll^l express IgH both on the cell surface and as a 
secreted form (Granovicz, E.S., et_al. , J. Immunol. . 

25 125:976 (1980)). If such cells are infected by 

retroviral vectors containing the terminal Cfi exons , the 
infected cells will likely produce both secreted and 
membrane bond forms of IgM (Vebb, C.F., et_al. , 
Immunol. . 143:3934-3939 (1989)). Still other methods can 

30 be used to detect antibody production. If the host cell 
E. coll . a nitrocellulose overlay is possible, and 
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such nethods have been frequently used to detect E, c oll 
producing particular proteins (Sambrook, J., et_al. , 
Molecular Cloning: A Lab or ato ry M anual , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, M.Y. 

05 (1989)). Other nethods of detection are possible and one 
in particular p vhich involves the concept of "viral 
coating", is discussed below. 

Viral coating can be used as a means of identifying 
viruses encoding antigen. combining molecules* In this 

10 method, a viral vector is used to direct the synthesis of 
diverse antibody molecules. Upon lytic infection of host 
cells, and subsequent cell lysis, the virus becomes 
"coated" with the antibody product it directs. That is, 
the antibody molecule becomes physically linked to the 

15 outside of a mature virus particle, vhich can direct its 
synthesis. Methods for viral coating are described 
below. Viruses coated by antibody can be physically 
selected on the basis of their affinity to antigen which 
is attached to a solid support. The number of particles 

20 vhich can be screened using this approach is well in 

9 11 
excess of 10 and it is possible that 10 different 

antibody genes could be screened in this manner. In one 

embodiment, an affinity matrix containing antigen used to 

purify those viruses encoding antibody molecules with 

25 affinity to antigen and which coat the surface of the 
virus which encodes those antibodies is used. 

One method of viral coating is as follows: A 
diverse library of bacteriophage A encoding parts of 
antibody molecules that are expressed in infected E . coll 

30 and which retain the ability to bind antigens is created. 
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using known techniques (Orlandi, R. , et al .. Proe. Natl. 
Acad. Scl. TJSA, 86:3833 (1989); Huse. W.D., et al .. 
Science, 246:1275 (1989); Better, M. , et al , . Science. 
240:1041 (1988); Skerra, A. and A. Pluckthon. Sc lence . 
05 240:1038 (1988)). Bacteria Infected with phage are 

embedded in a thin film of semisolid agar. Greater than 
10 infected bacteria may be plated in the presence of an 
excess of uninfected bacteria in a volume of 1 ml of agar 
and spread over a 10 cm^ surface. The agar contains 
JO monovalent antibody -A" (Parham, P., In Handbook o f 

Experimental Immu nolof s v: Immuno chem^ , Blackvell 

Scientific Publishers, Cambridge, MA, pp. 14.1-14.23 
(1986)). which can bind the X coat proteins and which has 
been chemically coupled to monovalent antibody "B" , which 
J5 can bind an epitope on all viral directed antibody 

molecules. Monovalent antibodies are used to prevent the 
crosslinking of viral particles. Upon lytic burst, 
progeny phage particles become effectively cross linked 
to the antibody molecule they encode. Because lysis 
20 occurs in semisolid medium, in which diffusion is slow, 
cross linking between a given phage and the antibody 
encoded by another phage is minimized. A nitrocellulose 
filter (ojT other protein binding filter) Is prepared as 
an affinity matrix by adsorbing the desired antigen. The 
25 filter Is then blocked so that no other proteins bind 

nonspeclflcally. The filter Is overlayed upon the agar, 
and coated phage are allowed to bind to the antigen by 
way of their adherent antibody molecules. Filters are 
washed to remove nonspeclflcally bound phage. 
30 Specifically bound phage therefore represent phage 



encoding antibodies with the desired specificity. These 
can now be propagated by reinfection of bacteria. 

Thus the present Invention makes it possible to 
produce antigen-binding aolecules which, like antibodies 
05 produced by presently-available techniques, bind to a 
selected antigen (i.e., having binding speclfity). 
Antibodies produced as described can be used, for 
exanple, to detect and neutralise antigens and deliver 
molecules to antigenic sites. 

10 IXAMPLE^I Amplification of IgM Heavy C hain Var iable 

Region DNA from mR NA 
IgM heavy chain variable DNA is amplified from mRNA 
by the procedure represented schematically in Figure 2. 
In Figure 2. Panel A depicts the relevant regions of the 

15 poly adenylated mRNA encoding the secreted form of the 
IgM heavy chain. In Panel A, S denotes the sequences 
encoding the signal peptide which causes the nascent 
peptide to cross the plasma membrane, a necessary step in 
the processing and secretion of the antibody. V, D and J 

20 derive from separate axons and together comprise the 

variable region. C^l, Cjj2, and Cjj3 are the three constant 
domains of Cp. "Hinge" encodes the hinge region. C, B 
and Z are oligonucleotide PGR primers used in the 
amplification process. The only constraints on Primers B 

25 end Z are that they are complementary to the mRNA, and 
occur in the order shown relative to C. Primer C, in 
addition to being complementary to mRNA, has an extra bit 
of sequence at its 5' end which allows the cloning of its 
PCR product. This is described below. Panel B depicts 

30 the reverse transcript DNA product of the mRNA primed by 



wo 91/10737 



PCr/US91/00209 



•32^ 



oligonucleotide Z, with the addition of poly*dC by 
terminal transferase at the 3' end of the product. Panel 
C depicts the annealing of primer A to the reverse 
transcript DNA represented in Panel B. Primer A contains 

05 the restriction endonuclease site REl, with additional 
DNA at its 5* end. The constraints on the REl site are 
described in Example 2. Panel D depicts the final double 
stranded DNA PGR product made utilizing primers A and B. 
Panel E depicts the PGR product shown in Panel D annealed 

10 to Primer G. Panel F is a blow up of panel E showing the 
structure of primer C. Primer C consists of two parts: 
a 3' part complementary to IgH heavy chain mRNA as shown, 
and a 5' part which contains restriction site RE2 and 
spacer. Constraints on RE2 are described in Example 2. 

15 Panel G depicts the final double stranded DNA PGR product 
utilizing Primers A and C and the product of the previous 
PGR (depicted in Panel D) as template. The S, V, D, J 
regions are again depicted. 

EXAMPLE 2 C on s tr uc tion of Heavy Chain Framework Vector 

20 PFHG 

A heavy chain framework vector, designated pFHC, is 
constructed, using known techniques (See Figure 3). It 
is useful for introdjicing antibody • encoding DNA into host 
cells, in which the DNA is expressed, resulting in 

25 antibody production. The circular plasmid (above) is 

depicted linearized (below) and its relevant components 
are shown. The neomycin antibiotic resistance gene 
(neo ) is useful for selecting transformed animal cells 
( S amb rook, J . , et al • , Molecular C loning: A Labor ato ry 

30 Manu al , 2d Ed., Cold Spring Harbor Laboratory Press, Cold 
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Spring Harbor, NY (1989)). Th. bacterial replication 
origin and aapicillin antibiotic resistance genes, useful 
respectively, for replication in E. coll and rendering E. 
eoli resistant to aapicillin, can derive froa any nuaber" 
of bacterial plasnlds. including PBR322 (Saabrook, J . . et 
— • * Molecular Cloninpjr k Laboratory Manual. 2d Ed. , 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor. 
HY (1989)). The Cp enhancer, which derives froa the 
mtron between exons J and C„l of the gene, derives 
froa any one of the cloned Cp genes (Kawakaai. T.. et 
al.. Nucleic Acids Researeh, 8:3933 (1980); Honjo. T. . 
Asn^Rev^_l25unol^, 1:499 (1983)) and increases levels of 
transcription from antibody genes. LTR contains the 
viral proaoter froa the Moloney MLV retrovirus DNA 
(Mulligan, R.C., Experimental Manipu lation of Can« 
E^Eiesslon, New York Academic Press, p. 155 (1983)). 
D represents the PCR primer described in the text, 
depicted in its 5' to 3' orientation. The only con- 
straints on D are its orientation, its coapleaentarity to 
pFHC and Its order relative to the REl and RE2 cloning 
sites. Preferably. D is within 100 nucleotides of REl. 
The cDNA cloning site contains restriction endonuclease 
sites REl-*and RE2. separated by spacer DNA which allows 
their efficient cleavage. The constraints on REl and RE2 
are described below. The Cm exons. as described in the 
text and literature, direct the synthesis of IgM heavy 
chain. Only part of Cj^l is present, as described below. 
Cjj3 is chosen to contain the C/»s region which specifies a 
secreted form of the heavy chain ((Kawakaml, T. , et al . . 
NH£lelc_Acids_Re search, 8:3933 (1980); Honjo, T.,~Ann. 
Bsy..lBausol^. 1:499 (1983)). Finally, pFHC centaiiir 
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poly A addition and termination sequences which can be 
derived £roin the gene itself (Honjo, T. . Ann. Rev. 
Immunol. , 1:499 (1983); ICawakami. T.. et al ... Nucleic 
Acids Research, 8:3933 (1980)). One potential advantage 
of using the entire Cm gene is that in some host cell 
systems, a membrane bound and secreted form of IgM may be 
expressed (Granowica. E.S., et al .. J. Im munol. 125:976 
(1980)). 

The plasmid can be produced by combining the 
individual components, or nucleic acid segments, depicted 
in Figure 3, using PGR eassett assembly (See below). 
Because the entire nucleotide sequence of each component 
is defined, the entire nucleotide sequence of the plasma 
is defined. 

The constraints on REl are simple. It should be the 
sole cleavage site on the plasmid for its restriction 
endonuelease. The choice of REl can be made by computer 
based sequence analysis (Intelligenetics Suite. Release 
5:35, Intelligenetics). 

The coss.t;jaints on RE2 are more complex. First, it 
must be the sole cleavage site on the plasmid for its 
restriction endonuelease, as described for REl, 
Moreover, ."the RE2 site must be such that when the PGR 
product is Inserted, a gene is thereby created which is 
capable of directing the synthesis of a complete IgM 
heavy chain. This limits the choices for RE2, but the 
choices available can be determined by computer based 
sequence analysis. The choices can be determined as 
follows. First, a list of restriction endonucleases that 
do not cleave pPHC is compiled (see Table 1). 
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TABLE 1 

Kon-Cuttln g Enzvnes for the Mouse Cu Gene 



10 



15 



Aatll 


Ahall 


Asel 


Avrll 


Bgll 


BspHI 


BssHII 


BstBI 


Clal 


Dral 


EagI 


EcoRI 


EcoRV 


Fspl 


Hgal 


Hindi 


Hpal 


Kpnl 


Hlul 


Nael 


Narl 


Ndel 


NotI 


Nrul 


PaeR7I 


Pvul 


RsrII 


SacII 


Sail 


Seal 


Sfll 


SnaBI 


Spel 


SphI 


Sspl 


StuI 


Tthllll 


Xbal 


Xhol 



These are called the "rare non-cutters." Next, the 
sequence of Cjjl is rewritten with "N** at the third 
position of each codon and entered into the computer. 
This is called the "N-doped sequence** (See Figure 4). 

20 Next, the rare non- cutters are surveyed by computer 
analysis for those which will cleave the N-doped 
sequence. The search program will show a possible 
restriction endonuclease site, assuming a match between N 
and the restriction endonuclease cutting site. For 

25 example, with 39 rare non-cutters, 22 will cleave the 
N-doped sequence of C/i Cjjl, many of them several times 
(see Table 2). In this table, "Def" means a definite cut 
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slte, of which there are none, because of the Ns . "Pes" 
means a possible cleavage site at the indicated nucleo- 
tide position if N is chosen appropriately. "Y" 
indicates any pyrimidine, "R" Indicates any purine and 
•N" indicates any nucleotide. The nucleotide positions 
refer to coordinates represented in Figure 4. 
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TABLE_2 
RECOGNITION 



CUT SITE 



Aatll 


(GACGTC) 


Def 


: none 


Ahall 




Pos 


: 250 


(GRCGYC) 


Def 


none 


Avrll 




Pos 


: 247 


(CCTAGG) 


Def 


none 


BspHI 




Pos 


: 204 


(TCATGA) 


Def 


: none 


Bsshll 




Pos 


: 138 


(GCGCGC) 


Def 


: none 


EcoRI 




Pos 


189 


(GAATTC) 


Def 


none 


EcoRV 




Pos 


195 


(GATATC) 


Def 


none 


Hgal 




Pos 


: 214 


(GACGCNNNNN) 


Def 


: none 


HlncII 


(NNNNNNNNNNGCGTC) 


Pos 


284 


(GTYRAC) 


Def 


: none 


Hpal 




Pos 


183 


(GTTAAC) 


Def ; 


; none 


Kpnl 




Pos ' 


220 


(GGTACC) 


Def : 


none 


Nrul 




Pos 


408 


(TCGCGA) 


Def . 


: none 


PaeR7 




Pos : 


174 


(CTCGAG) 


Def : 


none 


Pvul 




Pos ; 


190 


(CGATCG) 


Def : 


none 


Seal 




Pos : 


178 


(AGTACT) 


Def : 


none 


Spel .\ 




Pos : 


209 


(ACTAGT) 


Def : 


none 


SphI 




Pos : 


131 


(GCATGC) 


Def : 


none 


Sspl 




Pos : 


338 


(AATATT) 


Def : 


none 


StuI 




Pos : 


371 


(AGGCCT) 


Def : 


none 


Tthllll 




Pes : 


149 


(GACNNNGTC) 


Def : 


none 


Xbal 




Pos : 


212 


(TCTAGA) 


Def : 


none 


Xhol 




Pos : 


338 


(CTCGAG) 


Def : 


none 






Pos : 


190 



309 
306 



334 



220 



193 
339 

266 
167 



303 



284 



359 



339 
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Most of these cleavage sites (about 60%) are compatible 
with the amino acids specified by C„l. Therefore, it is 
possible to nutate C^l to create a unique site for such 
an enzyne without altering the amino acid sequence 
05 incoded by C^l. One sequence which illustrates this is 
shown below: 

1) ...ala met gly cys leu ala arg asp... 

2) •..GCC ATG CCC TGC CTA GCC CGG GAG... 

3) ...GCC ATG GGC TGC. CTA GCG CGC GAC . . . 

10 BssHII 

Line 1 represents part, of the actual amino acid 
sequence specified by the mouse C§i C^l gene region » and 
line 2 is the actual nucleotide sequence. By changing 
the sequence to the indicated nucleotides underlined on 

15 line 3, a cleavage site for the rare non-cutter BssHll is 
created. The new sequence (containing the BssHll site) 
GCG CGC still encodes the identical amino acid sequence. 
ThereforVf the sequence of the primer C is chosen to be 
the complement of line 3, and RE2 is the BssHll site. 

20 Such a primer will function in the PGR and vector 

construction as desired. Other examples are possible, 
and the same process can be used in designing vectors and 
primers for cloning light chain variable regions. 

The choice for primer C puts a constraint on pFHC. 

25 li^ the example shown, the C^^l region contained on pFHC 
must begin at its 5' end with the mutant sequence GCG 
CGC. Such mutant fragments can be readily made by the 
process of PGR cassette assembly described below. 
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The process of PGR cassette assembly is a method of 
constructing plasmid molecules (In this ease the plasmid 
pPHC) from fragments of DNA of known nucleotide sequence. 
One first compiles a list of restriction endonucleases 
that do not cleave any of the fragments. Each fragment 
is then individually PGR amplified using synthesized 
oligonucleotide primers complementary to the terminal 
sequences of the fragment. These primers are synthesized 
to contain on their 5' ends restriction endonuclease 
cleavage sites from the compiled list. Thus, each PGR 
product can be so designed that each fragment can be 
assembled one by one into a larger plasmid structure by 
cleavage and ligation and transformation into coll. 
Using this method, it is also possible to make~minor 
modifications to modify the terminal sequence of the 
fragment being amplified. This Is done by altering the 
PGR primer slightly so that a mismatch occurs. In this 
way It Is possible to amplify the gene starting 
precisely from the desired point In Gjjl (as determined by 
ollgo G above) and creating the RE2 endonuclease cleavage 



site. 



25 
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CLAIMS 



i5- Y|t re process for synthesizing DNA encoding a 
family of antigen -combining proteins, comprising the 
steps of: 

*^ obtaining DNA containing genes encoding 
antigen -combining proteins; 

b) combining the DNA containing genes encoding 
antigen- combining proteins with sequence 
specific primers which are oligonucleotides 
homologous to conserved regions of the genes; 
and 

c) performing sequence specific gene 
amplification. 



DNA encoding a family of antigen- combining proteins 
produced by the process of Claim 1. 

3. The process of Claim 1 wherein sequence specific 
gene amplification is performed by the polymerase 
chain reaction. 

4. The process of Claim 3 wherein the sequence specific 
20 primers are bidirectional. 



The process of Claim 3 wherein the sequence specific 
primers are nested unidirectional primers. 

The process of Claim 1 wherein the antigen-combining 
proteins are immunoglobulins. 



41. 
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7. 



The process of Claim 6 wherein the immunoglobulins 
are selected from the group consisting of heavy 
chains and light chains. 

8. The process of Claim 7 wherein the heavy chains are 
05 fi chains. 



9. 



10. 



11, 



The process of Claim 1 wherein the DNA containing 
genes encoding antigen-combinlng proteins is cDNA of 
RNA from antibody-producing cells. 

The process of Claim 1 wherein the DNA containing 
genes encoding antigen-combining proteins is genomic 
DNA from antibody-producing cells. 

The process of Claim 8 wherein the antigen-combining 
proteins are of mammalian origin. 



12. The process of Claim 1 wherein the primers are 
oligonucleotides homologous to conserved regions of 
the constant regions of Immunoglobulin genes. 

13. The process of Claim 1 wherein the primers are 
oligonucleotides homologous to the conserved regions 
of the variable regions of Immunoglobulin genes. 

14. The process of Claim 1 wherein the primers contain 
at least one restriction endonuclease cloning site. 

15. The process of Claim 1 wherein the primers are 
selected from the group consisting of 
oligonucleotide B of Figure 2 and oligonucleotide C 
of Figure 2. 
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A method of creating a diverse starter library of 
DNAs encoding families of antigen-combining proteins 
comprising cloning the product of Claim 1 into an 
appropriate vector. 

A diverse starter library of DNAs encoding families 
of antigen-combining proteins produced by the method 
of Claim 14. 

The method of Claim 16 wherein the vector is a 
prokaryotic vector or a eukaryotic vector. 

The method of Claim 16 wherein the vector is a viral 
vector or a retroviral vector. 

The method of Claim 16 wherein the vector is a 
plasmid. 

The method of Claim 20 wherein the plasmid is 
selected from the group consisting of pFHC and pLHC. 

The method of Claim 16 wherein the vector is 
selected from the group consisting of expression 
vectors and cloning vectors. 

The method of Claim 22 wherein the expression vector 
is appropriate for expression of the variable region 
of an antigen- combining protein as a chimeric 
molecule in register with a framework protein. 

The method of Claim 23 wherein the framework protein 
is an immunoglobulin. 



-43- 

25. The nethod of Claim 24 vherein the Immunoglobulin is 
all or a portion of the constant region of the m 
heavy chain. 

26. The aethod of Claim 16 further comprising creating a 
05 collection of viral particles from viral vector- 
based libraries of DNA encoding antigen-combining 
proteins by the process of introducing viral vectors 
into host cells in which they replicate and form 
viral particles. 

10 27. A method of producing a high diversity library of 

DNA encoding families of antigen-combining proteins 
comprising mutagenizlng the product of Claim 16. 

28. A high diversity library of DNA encoding families of 
antigen-combining proteins produced by the method of 

15 Claim 27. 

29. The method of Claim 27 vherein mutagenizlng Is 
carried out by random chemical mutagenesis. 

30. The ^method of Claim 27 vherein mutagenizlng Is 
carried out by performing the polymerase chain 

20 reaction under limiting nucleotide conditions . 

31. The method of Claim 27 vherein mutagenizlng is 
carried out in such a manner that mutagenesis is 
limited to DNA encoding variable regions of the 
antigen-combining protein. 

25 32. A process of producing a diverse population of host 
cells vhlch comprises introducing Into host cells 
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DNA of the starter library or high diversity 
libraries of antlgen-combining proteins. 

33. Host cells produced by the method of Claim 32. 

34. The process of Claim 32 wherein the host cells are 
05 prokaryotic. 

35. The ^process of Claim 32 wherein the host cells are 
eukaryotic. 

36. The process of Claim 35 Vherein the host cells are 
selected from the group consisting of Immortalized 

3*0 cultured mammalian cells. 

37. The process of Claim 36 wherein the immortalized 
cultured mammalian cells are selected from the group 
consisting of myelomas and plasmacytomas. 

38. The process of Claim 32 wherein the libraries 

15 encoding families of antigen-combining proteins are 

introduced into host cells by a method selected from 
the* group consisting of: electroporation » calcium 
phosphate coprecipitation, protoplast fusion, viral 
infection, and^eell fusion. 

20 39^ The process of Claim 32 wherein the libraries of 

DNAs encoding families of antigen*combining proteins 
is contained in an expression vector. 



25 



40. 



The process of Claim 32 wherein the DNAs encoding 
families of antigen-combining proteins encode 
antigen-combining proteins selected from the group 



05 



10 
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consisting of immunoglobuiin heavy chain variable 
regions or innunoglobulln light chain variable 
regions* 

41. The process of Claln 40 wherein DNAs encoding 

. iamunoglobulln heavy chain variable regions are 
Introduced sinultaneously with or sequentially to 
DMAs encoding immunoglobulin light chain variable 
regions. 

42. The method of Claim 32 further comprising 
identifying cells which produce antigen*combining 
molecules of selected specificity. 



43. The method of Claim 42 wherein identifying of cells 
which produce antigen-combining molecules of 
selected specificity is carried out by assaying 

15 cellular supernatants for antigen-combining 

activity. 

44. The method of Claim 42 wherein identifying of cells 
which produce antigen-combining molecules of 
selected specificity is carried out by a 

20 nitrocellulose filter overlay technique. 

45. The method of Claim 44 wherein cells producing 
antigen-combining molecules of selected specificity 
are enriched for cells producing antigen-combining 
molecules on their surface by affinity matrix 

25 chromatography . 



46. 



Cells produced by the method of Claim 42. 
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47. Antigen-combining molecules produced by cells of 
Claim 42. 

48. DNAs encoding immunoglobulin heavy chain variable 
regions or immunoglobulin light chain variable 

05 regions, present in cells of Claim 42. 

49. Viruses produced by the method of Claim 26. 

50. A method of isolating viruses of Claim 49 encoding 
antigen- combining molecules of selected specificity, 
comprising the steps of: 

10 a) infecting host cells with an appropriate virus 

containing DNA encoding antigen-combining molecules; 

b) coating the virus with antigen-combining 
molecules which the virus encodes; and 

c) subjecting the product of step (b) to 

15 affinity-matrix selection, to separate the virus 

according to the antigen-combining molecules they 
contain;** 

51. Viruses produced by the method of Claim 50. 

A 

52. Antigen- combining molecules encoded by viruses of 
20 Claim 51. 
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